Partially observable Markov decision process について

Words near each other

・ Particle physics experiments
・ Particle physics in cosmology
・ Particle Physics Project Prioritization Panel
・ Particle radiation
・ Particle segregation
・ Particle shower
・ Particle size
・ Particle size (disambiguation)
・ Particle size analysis
・ Partial-matching Meet-in-the-Middle attack
・ Partial-order planning
・ Partialism
・ Partially Buried Woodshed
・ Partially disclosed principal
・ Partially guyed tower
・ Partially observable Markov decision process
・ Partially observable system
・ Partially ordered group
・ Partially ordered ring
・ Partially ordered set
・ Partially ordered space
・ Partially selective school (England)
・ PartiallyClips
・ Partible inheritance
・ Partible paternity
・ Partibrejkers
・ Partibrejkers discography
・ Partibrejkers I
・ Partibrejkers II
・ Partibrejkers III

Dictionary Lists

mini英和辞書

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Partially observable Markov decision process ：ウィキペディア英語版

Partially observable Markov decision process
A partially observable Markov decision process (POMDP) is a generalization of a Markov decision process (MDP). A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot directly observe the underlying state. Instead, it must maintain a probability distribution over the set of possible states, based on a set of observations and observation probabilities, and the underlying MDP.
The POMDP framework is general enough to model a variety of real-world sequential decision processes. Applications include robot navigation problems, machine maintenance, and planning under uncertainty in general. The framework originated in the operations research community, and was later taken over by the artificial intelligence and automated planning communities.
An exact solution to a POMDP yields the optimal action for each possible belief over the world states. The optimal action maximizes (or minimizes) the expected reward (or cost) of the agent over a possibly infinite horizon. The sequence of optimal actions is known as the optimal policy of the agent for interacting with its environment.
==Definition==

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Partially observable Markov decision process」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース